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Abstract In this paper, we study an inexact steepest descent method, with Armijo's rule, for multicriteria 
optimization. The sequence generated by the method is guaranteed to be well-defined. Assuming quasi- 
convexity of the multicriteria function we prove full convergence of the sequence to a critical Pareto point. 
As an application, this paper offers a model of self regulation in Psychology, using a recent variational 
rationality approach. 
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1 Introduction 

The steepest descent method with Armijo's rule for real continuously differentiable optimization problem 
(see, for instance, Burachik et al. [8]), generates a sequence such that any accumulation point of it, if any, 
is critical for the objective function. This was later generalized for multicriteria optimization by Fliege and 
Svaiter namely, whenever the objective function is a vectorial function. The full convergence result 
for real optimization problem was assured when the solution set of the problem be non-empty and the 
objective function is convex (see Burachik et al. [8]) or, more generally, a quasi-convex function (see Kiwiel 
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and Murty [T5], Bello Cruz and Lucambio Perez [3]). This result has been generalized for convex vectorial 
optimization by Graha Drummond and Svaiter |14j (see also Graha Drummond and Iusem [15] , Fukuda and 
Grana Drummond |12j ) and, in the quasi-convex case, for multicriteria optimization by Bento et al. [6] (see 
also Bello Cruz et al.[5j). For extensions of other scalar optimization methods to the vectorial setting see, 
for instance, [7J QUI HH and references therein. 

As far as we know, Bento et al. [6] presented the first result of full convergence of the exact steepest 
descent method, with Armijo's rule, for quasi-convex multicriteria optimization, which includes contributions 
within Euclidean and Riemannian context; see also Bello Cruz et al. [5 . In the present paper, we study 
the method proposed by Fliege and Svaiter which is the inexact version of the method presented in 
[6]. In this method are admitted relative errors on the search directions, more precisely, an approximation 
of the exact search direction is computed at each iteration. In this paper, we proved full convergence of the 
sequence generated by this inexact method to a critical Pareto point associated to quasi-convex multicriteria 
optimization problems. In particular, we proved full convergence of the sequence to a weak Pareto optimal 
point in the case that the objective function is pseudo-convex. 

The organization of our paper is as follows: In Section [21 we present the self regulation problem in 
the context of Psychology. In Section [31 the multicriteria problem, the first order optimality condition 
for it and some basic definitions are presented. In Section [4j the inexact steepest descent method for 
finding one solution of multicriteria problems is stated and the well-definedness of the sequence generated 
for it is established. In Section [31 a partial convergence result for continuous differentiability multicriteria 
optimization is presented without any additional assumption on the objective function. Moreover, assuming 
that the objective function be quasi-convex and the Riemannian manifold has non-negative curvature, a 
full convergence result is presented. Finally, Section [6] offers a "distal-proximal" model of self regulation in 
Psychology, using a recent variational rationality approach ( [551 HTJ HH1 US] ) which modelizes behaviors as an 
approach, or avoidance, course poursuit between "desired, or undesired enough" ends, and "feasible enough" 
means. 

2 The Self Regulation Problem 

In this section devoted to applications we direct the attention of the reader to the very important "multi- 
ple goals" self regulation problem in Behavioral sciences. We show the strong link between: i) our paper 
which extends the steepest descent methods of Fliege and Svaiter to the quasiconvex case in multicri- 
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teria optimization and ii) the "variational rationality" approach of the "theories of change" of Soubeyran 
[2"o1 |2"7I |2"5I |2T)] .Change problems consider "why, how, and when" it is worthwhile to move from a bad or 
not so good situation x € X to a better one y G X (known or unknown), the limit case of full rationality 
being an optimizing one, the case of bounded rationality being a better one (in a lot of different formula- 
tions, depending of the context). The variational rationality approach examines two polar kinds of "change 
problems" , choice and transformation problems: i) adaptive choice problems like the "choosing the context 
to choose" problem (the formation of consideration sets), and ii) transformation problems like creation and 
destruction, invention, innovation, the evolution of institutions, dynamics interactions (dynamic games), 
health, behavioral, organizational and cultural changes, ... in Economics, Decision theory, Management, 
Psychology, Artificial Intelligence, Philosophy, Sociology, Applied Mathematics (Variational Analysis, Op- 
timization and Variational Inequalities). In this "variational context" our present paper shows how setting 
joint distal and proximal goals greatly help to reach a distal goal. It offers an "aspiration driven local search 
proximal algorithm" . This variational approach emphasizes, each step of the process, two main variational 
principles (among others, in more general settings): a "satisficing-but not too much sacrificing" principle 
and a "worthwhile to change" principle. Because the state space of situations is the Euclidian space X = R™, 
changes u = y — x from a given situation i to a hopefully better situation y can be characterized by their 
directions »el and their depth (length) t > 0. In this context where u — tv these two variational principles 
specialize to, 

i) the choice, each step, of a "satisficing but not too much sacrificing" direction (a directional "satisficing- 
but not too much sacrificing" principle); 

ii) the choice, each step, of a "worthwhile change" (a "worthwhile to change" step length principle). 
2.1 Self regulation problems 

Self regulation considers the systematic activities (efforts) made to direct thoughts, feelings, and actions, 
towards the attainment of one's goals (Zimmerman [32 ). A goal is a conscious or unconscious mental 
representation of some future end (to approach or to avoid), more or less distant, abstract (concrete), vague 
(precise), desirable and feasible. 

Goals can be more or less desirable and more or less feasible. Related to the desirability aspects are 
conscious or not, vague or concrete, distal or proximal, long term or short term, extrinsic or intrinsic, set 
by others or oneself, individual or collective, learning or performance oriented contents, high or low in 
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commitment .... Related to the feasibility aspects are importance, priority, urgency, direction, intensity, 
difficulty, measurability, 

Self regulation have two aspects. The positive side of self regulation considers purposive processes where 
agents engage in goal-directed actions. It examines goal setting, goal striving and goal pursuit processes. 

- Goal setting is the mental process of moving from the consideration of distal goals to the formation 
of more proximal goals. Distal goals are desired future ends (visions, imaginated desired futures), either 
promotion aspirations (like ideals, fantasies, dreams, wishes, hopes and challenges) or prevention aspirations 
(like oughts and obligations). They represent desirable but quite irrealistic distal and vague ends (higher 
order goals). Proximal goals can be wants, intentions, task goals, i.e much more feasible but less desirable 
intermediate ends (sub goals). 

- Goal striving ( goal implementation) examines the transition phase between setting a distal goal and 
reaching it. 

- Goal pursuit (goal revision) focuses on the final phase, after reaching the given goal or failing to reach it. 
It examines the role of feedbacks (self evaluations of successes and failures, including the revision of causal 
attributions and self efficacy beliefs, see Tolli and Schmidt [30 ) in order to revise goals. 

The negative side of self regulation considers what an agent must refrain to do instead of what he must do 
to set and attain some given goal. This negative aspect of self regulation is named self control (overriding of 
one action tendency in order to attain an other goal) . It considers self regulation failures like lack of vision, the 
inability to transform irrealistic aspirations into intentions and realistic proximal goals (preparation to action 
problems), procrastination and inertia (starting problems), interruptions, distractions, temptations, lack of 
feedbacks, lack of interest, perseverance and motivation (on the track problems) and goal disengagement 
(ends problems). 

Our paper considers only the positive aspect of self regulation. It focuses on proximal goal setting 
activities, examines some aspects of goal revision activities, and renounce to consider goal striving activities. 

2.2 Setting proximal goals 

The Michael Jordan "step by step" principle: The famous basketball player Michael Jordan wrote 
the following about goal setting in his book (Jordan and Miller [E]), "I approach everything step by step 
. . . . I had always set short-term goals. As I look back, each one of the steps or successes led to the next 
one. When I got cut from the varsity team as a sophomore in high school, I learned something. I knew I 
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never wanted to feel that bad again .... So I set a goal of becoming a starter on the varsity. That's what I 
focused on all summer. When I worked on my game, that's what I thought about. When it happened, I set 
another goal, a reasonable, manageable goal that I could realistically achieve if I worked hard enough .... I 
guess I approached it with the end in mind. I knew exactly where I wanted to go, and I focused on getting 
there. As I reached those goals, they built on one another. I gained a little confidence every time I came 
through .... 

Goal hierarchies and goal proximity: the Bandura dual "proximal-distal" self regulation 
principle: Bandura [3] argued that people possess multiple systems of goals, hierarchically arranged from 
proximal goals to extreme distal goals. Goal proximity defines "how far goals are conceptualized into the 
future". A goal hierarchy interconnects at least three levels of goals: peak goals (higher order goals, like 
visions, dreams, fantasies, aspirations, ideals, wishes, hopes), distal goals (challenges), and task goals.... 
A subset of task goals can be subordinate to distal goals which can be subordinate to peak goals. Hence, 
the proximal goal distinction is relative to the interconnected network of goals, other goal's providing the 
temporal context). The main point to be emphasized is that distal goals and proximal goals serve different and 
complementary conative functions (connected to cognition, affect and motivation) related to goal difficulty, 
goal commitment, psychological distance. . . . 

i) distal goals define desired ends (enduring aspirations) that attract individuals; 

ii) proximal goals regulate immediate conative functions, which provide the ways to find and follow a 
path of step by step changes moving from the initial situation to approach the desired end or avoid an 
undesirable end. In this context it is important to distinguish task goals and strategies. The former 
defines what is to be accomplished, and the later defines how it is to be accomplished (Wood and 
Bandura |31j). 

3 The Multicriteria Problem 

In this section, we present the multicriteria problem, the first order optimality condition for it and some 
basic definitions. 

Let I := {1,...,™}, R™ = {x e M m : x l > 0,j e /} and K™ + = {x G R m : Xj > 0,j 6 I}. For 
x, y e RIP, y >z x (or x <y) means that y — x € Rip and y >- x (or x -< y) means that y — x e R™ + . 

Given a continuously differentiable vector function F : R n — > R m , we consider the problem of finding a 
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optimum Pareto point of F, i.e., a point x* £ K n such that there exists no other x £ M" with F(x) ^ F(x*) 
and F(x) ^ F(x*). We denote this unconstrained problem as 

mhxreR™ F(x). (1) 

Let F be given by F(x) := (/i(x), . . . , f m (x)). We denote the jacobian of F by 

JF(x) := (V/i(x), . . . , V/ m (x)) , xef, 

and the image of the jacobian of F at a point x G M" by 

Im( JF(x)) := { JF(x)v = ((S7f\(x), «),..., <V/ m (x), u)) : i> G R"} . 

Using the above equality, the first-order optimality condition for Problem [1] (see, for instance, [IT]) is stated 
as 

x £ R™, Im( JF(x)) n (-R^+) = 0- ( 2 ) 

Note that the condition in ([2]) generalizes, to multicriteria optimization, the classical condition "gradient 
equals zero" for the real- valued case. 

In general, ([2]) is necessary, but not sufficient, for optimality. A point of R n satisfying ([2]) is called critical 
Pareto point. 

4 Inexact Steepest Descent Methods for Multicriteria Problems 

In this section, we state the inexact steepest descent methods for solving multicriteria problems admitting 
relative errors in the search directions, more precisely, an approximation of the exact search direction is 
computed at each iteration, as considered in [T4l [T2] . 

Let x £ R n be a point which is not critical Pareto point. Then there exists a direction v £ R™ satisfying 

JF(x)v £ -r'; i + , 

that is, JF(x)v -< 0. In this case, v is called a descent direction for F at x. 

For each x £ M™, we consider the following unconstrained optimization problem in R™ 

min {max 4e/ (V/ 4 (x),«) + (l/2)|M| 2 }, / := {1, . . . , m}. (3) 
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Lemma 4.1. The following statements hold: 

i) The unconstrained optimization problem in (0) has only one solution. Moreover, the vector v is the 
solution of the problem in (0) if and only if there exists on > 0, i G I(x, v), such that 

v = - a ^M x ), X] a ' L = 1; 

where I(x,v) := {i € I : (V/i(x),u) = max ie/ (V/ l (x), v)}; 

ii) If x is critical Pareto point of F and v denotes the solution of the problem in {3J) ; then v = and the 
optimal value associated to v is equal to zero; 

Hi) If ' x G W is not a critical Pareto point of F and v is the solution of the problem in (0), then v ^ and 

max 4e/ (V/ 4 (x),«) + (l/2)|| W || 2 <0. 

In particular, v is a descent direction for F at x. 

Proof. The proof of the item i can be found in [BJ. For the proof of the remaining items, see, for example, 

m □ 

Remark 4.1. From the item i of Lemma \4-.1\ we note that the solution of the minimization problem f3|) is 
of the form: 

v = -JF(x) t w, w = (ai, . . . ,a m ) G M+, |Mli = 1 {sum norm inW* 1 ), 

with ai — fori G I\I(x,v). In other words, if S := {e^ G K m : i £ /} (set of the elements of the canonical 
base of Euclidean space W. m ), then w is an element of the convex hull of S(x,v), where 

S(x,v) :— {u G S : (u, JF(x)v) = max ugS (M, JF(x)v}}. (4) 

Note that the minimization problem ^ may be rewritten as follows: 

min {max ues (w, JF{x)v) + (l/2)||u|| 2 } = min {max ueS ( JF (xfu, v) + (1/2) ||u|| 2 } . 

In view of the previous lemma and ([3]), we define the steepest descent direction function for F as follows. 

Definition 4.1. The steepest descent direction function for F is defined as 

R n 3 x i — > v(x) := argmm vmn {max ie/ (Vfi(x),v) + (l/2)||w|| 2 } G R". 
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Remark 4.2. This definition was proposed in [11] . Note that, from the item i of Lemma l4.1N £ follows that 
the steepest descent direction for vector functions becomes the steepest descent direction when m = 1. 

The optimal value associated to v(x) will be denoted by a(x). Note that the function 

K"3i 1 — > max 4e/ (Vfi(x),v) + (1/2)|M| 2 G R, 

is strongly convex with modulus 1/2 and 

G d (max ie j(V/i(x), . ) + 1/2||.|| 2 ) (v(x)). 

So, for all v G R", 

m a ^ eI (\7f l (x),v) + (l/2)\\v\\ 2 -a(x) > l/2\\v - v(x)\\ 2 . (5) 

Lemma 4.2. The steepest descent direction function for F, R ra 3 x h- > v(x) G K", is continuous. In 
particular, the function l n 3i4 a(x) £WL is also continuous. 

Proof. See [6] for the proof of the first part. The second part is a immediate consequence of the first. □ 

Definition 4.2. Let a G [0, 1). A vector v G 1™ is say be a a— approximate steepest descent direction at x 
for F if 

maxi< l < m (V/ i (x),w) + l/2||w|j 2 < (1 - cr)a(x). 

Note that the exact steepest descent direction at a; is a cr-approximate steepest descent direction for F 
with a = 0. As a immediate consequence of Lemma 14. II together with last definition, it is possible to prove 
the following: 

Lemma 4.3. Given x G R ra , 

a) v = is a o -approximate steepest descent direction at x if, only if, x is a critical Pareto point; 

b) if x is not a critical Pareto point and v is a a -approximate steepest descent direction at x, then v is a 
descent direction for F . 

Next lemma establishes the degree of proximity between an approximate direction v and the exact 
direction v(x), in terms of the optimal value a(x). 

Lemma 4.4. Let a G [0, 1). If v G M™ is a a— approximate steepest descent direction at x, then 

\\v-v{x)\\ 2 < 2a\a(x)\. 
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Proof. The proof follows from ([5]) combined with Definition 14.21 See [T3] . □ 

A particular class of a- approximate steepest descent directions for F at x is given by the directions 
v G K n which are scalarization compatible, i.e., such that there exists w 6 convS* with 

v = -JF(x) t w. (6) 

Note that w determines a scalar function g(x) := (w,F(x)) whose steepest descent direction coincides with 
v, which justifies the name previously attributed to the direction v; see [14| for a good discussion. 

Next proposition establishes a sufficient condition for v, given as in ([6]), to be a cr-approximate steepest 
descent direction for F at x. 

Proposition 4.1. Let a G [0, 1) and v as in (0|). If 

maxi eI (Vfi(x),v) < -(1 - a/2)||v|| 2 , 

or equivalently, 

max ueS (JF(^) t u !U ) < -(1 - <j/2)\\v\\ 2 , (7) 
then v is a a -approximate steepest descent direction for F at x. 

Proof. See [HI. □ 

From Remark 14.11 we note that, for each x £ K n , the steepest descent direction for F at x, v(x), is 
scalarization compatible. Next lemma tell us that v(x) satisfies the sufficient condition of the last proposition 
with (7 = and, hence, that such condition is natural. 

Lemma 4.5. The following statements hold: 

i) a(x) = -{l/2)\\v(x)f; 

ii) max„ eS (J 'F{x) l u,v{x)) = -||w(a;)|| 2 . 
Proof. In order to prove the item i note that 

a{x) = ma^esiJFixY^vix)) + (l/2)||«(a;)|| 2 . (8) 
Moreover, from Remark 14. 1\ we have 

v(x) = - JF{xfw, w e convS , (i)(i:)), S(v(x)) := S(x,v(x)), (9) 
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where convS(v(x)) denotes the convex hull of S(v(x)). So, combining © and © with the definition of 
S(v(x)), we get 

a(x) = (JF(x)% -JF{xfw) + (l/2)\\JF(x) t w\\ 2 , u £ S(v{x)). 

Hence, 

a(x) = (JF(xYw, -JF{xfw) + (l/2)\\JF(x) t w\\ 2 , 

from where it follows the item i. The item ii is an immediate consequence of the item i combined with 
©. □ 

The inexact steepest descent method with the Armijo rule for solving the unconstrained optimization 
problem ([1} is as follows: 

Method 4.1 (Inexact steepest descent method with Armijo rule). 

Initialization. Take (3 e (0, 1) and x° G M". Setk = 0. 

Stop criterion. 7/a; fc is a critical Pareto point STOP. Otherwise. 

Iterative Step. Compute a a- approximate steepest descent direction v k for F at x k and the steplength 
tk G]0, 1] as follows: 

t k := max{2^' : j G N, F (x k + 2~ j v k )) r< F(x k ) + (32- j JF(x k )v k } , (10) 

and set 

x k+1 := x k + t k v k , (11) 

and GOTO Stop criterion. 

Remark 4.3. The previous method was proposed by Fliege and Svaiter Jllf and becomes the classical steepest 
descent method when m = 1. Other variants of Method \4-l\ can be found in \14\ \15\ 



Next proposition ensures that the sequence generated by the Method l4.1l is well-defined. 

Proposition 4.2. The sequence {x k } generated by the steepest descent method with Armijo rule is well- 
defined. 



Proof. The proof follows from the item ii of Lemma 14.31 combined with the fact that F is continuously 
diffcrentiable. See [11] for more details. □ 
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5 Convergence Analysis 

In this section, we present a partial convergence result without any additional assumption on F besides the 
continuous differentiability. In the sequel, assuming quasi-convexity of F and following the ideas of [141 [6] , 
we extend the full convergence result presented in j 141 for quasi-convex multicriteria optimization. It can be 
immediately seen that, if Method 14. II terminates after a finite number of iterations, then it terminates at a 
critical Pareto point. From now on, we will assume that {x k }, {v k } and {t k } are infinite sequences generated 
by Method O 

To simplify the notation, in what follows we will utilize the scalar function ip : R m — > R defined as follows: 

(p(y) = max, ie/ (y,e i ), 7={l,...,m}, 

where {e^} C K m is the canonical base of the space R m . It is easy to see that the following properties of the 
function tp hold: 

<p{x + y) < <p(x) + <p(y), <p(tx) = t<p(x), x,yeR m , t>Q. (12) 
x^y p{x)<p{y), x,yeR m . (13) 

5.1 Partial Convergence Result 

The following theorem shows that if F is continuously differentiable then the sequence of the functional 
values of the sequence {x k }, {F(x k )}, is monotonously decreasing and the accumulation points of {x } are 
critical Pareto points. The proof of the next theorem can be found partly in [IT] and pTj. We chose to 
present a proof within this paper. 

Theorem 5.1. The following statements hold: 

i) {F(x k )} is decreasing; 

ii) If {x k } has accumulation point, then {t1\\v k \\ 2 } is a summable sequence and 

lim t fc ||^f = 0; (14) 

k— »+oo 

Hi) Each accumulation point of the sequence {x k }, if any, is a critical Pareto point. 
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Proof. The iterative step in Method 14.11 implies that 

F(x k+1 ) ±F(x k ) + (3t k JF(x k )v k , x k+1 = x k +t k v k , k = 0,l,.... (15) 

Since {x k } is an infinite sequence, for all k, x k is not a critical Pareto point of F. Thus, the item i follows 
from the item ii of Lemma 14.31 combined with the last vector inequality. 

Suppose now that {x k } has an accumulation point x S W 1 and let {x ks } be a subsequence of {x k } such 
that lim s _^. +00 x ks = x. Since F is continuous and lim s _j. +00 x ks = x we have lim, j _ ) . +00 F(x ks ) — F(x). So, 
taking into account that {F(x k )} is a decreasing sequence and has F(x) as an accumulation point, it is easy 
to conclude that the whole sequence {F(x k )} converges to F(x). So, from the definition of the function (p, 
we conclude that {(p(F(x k ))} converges to ip(F(x)) and, in particular, 

<p(F(x)) < v(F(x k )), fc = 0,l,.... (16) 

From (fTS")) . (fT2")) . |T3]) and definition of v k , we obtain 

v(F(x k+1 )) < ^(F(x k )) + (3t k ((1 - a)a(x k ) - (1/2)||^|| 2 ) , k = 0, 1 . . . , 

or, equivalently, 

(p(F(x k+1 )) - V (F{x k )) < ((1 - a)t k a(x k ) - (l/2)t k \\v k \\ 2 ) , k = 0,1 (17) 

Adding the last inequality from k = to n and taking into account that = — ol{% ), we have 

n 

V (F{p n+l )) - ^F(p )) <-Pj2 K 1 " ^kHx k )\ + (l/2)i,||^|| 2 ] . 

fc=0 

Thus, because (3 € (0, 1) and <p(F(x)) < ip(F(p n+1 )) (see (fl6|) ). from the last inequality, we get 
± [(1 - a)t kH x k )\ + <l/2)t k \\A*\ < ^W)) -^)) , „ > o. 

fc=0 " 

But this tell us that (recall that a £ [0, 1)) 

-f-oo +oo 

^t k \a{x k )\ < +oo and ^* fc |[«*|| a < +oo, (18) 

fc=0 fc=0 

from which follows the second part of the item ii. The first part of the item ii follows from last inequality 
in (fTS)) together with the fact that t k € (0, 1]. 
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We assume initially that x is an accumulation point of the sequence {x k } and that {x ks } is a subsequence 
of {x k } converging to x. From Lemma l4.21 we may conclude that {v(x ka )} and {a x k B } converge, respectively, 
to v(x) and a 2 . In particular, from Lemma 14.41 it follows that {v ks } is bounded and, hence, has a convergent 
subsequence. Moreover, the sequence {t^} C]0, 1] also has an accumulation point t £ [0,1]. We assume, 
without loss of generality, that {tk s } converges to t and {v k °} converges to some v. From the equality (|14l) . 
it follows that 

lim t ks \\v k °\\ 2 = 0. (19) 

s— »+oc 

We have two possibilities to consider: 

a) t > 0; 

b) i=0. 

Assume that item a holds. Then, from (|19p . it follows that v = 0. On the other hand, from the Definition 
14.21 of v k , we obtain 

max 1 < i < m (S7Mx k °),v k °) + l/2\\v k °\\ 2 < (l-*)a(x k °), s = 0,l.... 

Letting s go to +oo in above inequality, it follows that v = is a cr-approximation steepest descent method 
for F at x and, from the item i of Lemma 14.31 we conclude that a; is a critical Pareto point of F. 

Now, assume that item b holds true. Since v ka is a cr-approximation steepest descent method for F at 
x ks and {x ks } is not a critical Pareto point, we have 

max ie/ (V/ i (a: fc -),u fc *) < max^^V/,^),^ 3 ) + (l/2)||« fes || 2 < (1 - a)a{x k *) < 0, 

where the last inequality is a consequence of the item Hi of Lemma 14.11 Hence, letting s go to +oo in the 
last inequalities and using that {v k " } converges to v, we obtain 

nUKi 6/ <V/i(x), v(x)) < (1 - a)a{x) < 0. (20) 

Take r £ N. Since {tk B } converges to t — 0, we conclude that if s is large enough, 

t ks < 2- r . 

From (fTU| this means that the Armijo condition (fT5|) is not satisfied for t = 2~ r , i.e., 

F(x k +2^v k ") £ F{x k °) + (32- r JF(x k °)v k % 
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which means that there exists at least one i$ € I such that 

f i0 (x k * +2- r v k °) > .h (x k °)+f32- r (Vf l0 (x k °),v k °). 

Letting s go to +00 in the above inequality, taking into account that V/i and exp are continuous and using 
that {v ka } converges to v, we obtain 

f i0 (x + 2- r v(x)) > f io (x) + f32- r (Vf io (x),v(x)). 

The last inequality is equivalent to 

which, letting r go to +00 and assuming that < < 1, yields (V/i (x), v(x)) > 0. Hence, 

max ie j(V/i(x),w(x)) > 0. 
Combining the last inequality with (|20[) and taking into account that a G [0, 1), we have 

a(x) = 0. 

Therefore, from the item in of Lemma 14-11 it follows that a; is a critical Pareto point of F, and the proof is 
concluded. □ 

Remark 5.1. If the sequence {x k } begins in a bounded level set, for example, if 

L F (F( Po )) := {x e R" : F(x) 1 F(p )}, 

is a bounded set, then, since F is a continuous function, Lf(F(po)) is a compact set. So, item i of Theo- 
rem [53] implies that {x k } C Lp(F(po)) and consequently {x k } is bounded. In particular, {x k } has at least 
one accumulation point. 

5.2 Full Convergence 

In this section, under the quasi- convexity assumption on F, full convergence of the steepest descent method 
is obtained. 

Definition 5.1. Let H : M. n — > M. m be a vectorial function. 
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i) H is called convex iff for every 1,1/6 W n ) the following holds: 

H((l - t)x + ty) r< (1 - t)H{x) + tH(y), t £ [0, 1]; 

ii) H is called quasi-convex iff for every x,y £ K™, the following holds: 

H((l - t)x + ty) < ma,x{H(x), H(y)}, t £ [0, 1], 
where the maximum is considered coordinate by coordinate; 
Hi) H is called pseudo-convex iff H is differ entiable and, for every x,y £ R™, the following holds: 

JH{x){y-x) ^0 => H(y)^H(x). 

Remark 5.2. For the two first above definitions see Definition 6.2 and Corollary 6.6 of [23] . pages 29 and 
31. respectively. For the third definition see Definition 9.2.3 of U3f , page 27 '4. ./Vote t/iat i? is convex (resp. 
quasi-convex) iff, H is componentwise convex (resp. quasi-convex) . On the other hand, H componentwise 
pseudo-convex is a sufficient condition , but not necessary for H to be pseudo-convex; see Theorem 9.2.3 
of U3f . page 21 A and Remark \5.!A It is immediate from above definitions that if H is convex then it is 
quasi-convex (the reciprocal is clearly false). If H is differ entiable, convexity of H implies that for every 
x,y £ R", 

JH{x){y-x)<H{y)-H{x), (21) 

from which we may conclude that H is pseudo-convex. It is easy to obtain an example showing that the 
reciprocal is false. 

Next proposition provides a characterization for differentiable quasi-convex functions. 

Proposition 5.1. Let H : R™ — > R' m be a differentiable function. Then, H is a quasi-convex function if, 
only if, for every x,y £ R", it holds 

H{y)^H{x) =*> JH{x){y-x)<Q. 

Proof. Let us assume that, for every pair of points x, y £ R", it holds 

H{y)<H{x) =*> JH(x)(x-y) <0. (22) 
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Take x, y G R n and assume that holds 

H(y)^H((l-t)x + ty), t e [0, 1). 
Using (|22l) with y — y and x = (1 — t)x + ty, we obtain (1 — t)JH{{\ — t)x + ty)(x — y) < 0, which implies 

- t)x + ty) = (Vft,-((1 - i)i + ty), y - x) < 0, i£{l m}, 

where hi, ... , h m represent the coordinate functions of H. But this implies that 

hi((l - t)x + ty) < hi(x), i e {1, . . . , m}, 

and, hence, that 

((1 - t)x + ty) < H(p) = max{H(x),H(y)}, 

which proves the first part of the proposition. The proof of the second part follows immediately from the 
definition of quasi-convexity combined with differentiability of H; see [6 for more details. □ 

From the previous proposition follows immediately that pseudo-convex functions are quasi-convex. The 
reciprocal is naturally false. Next proposition provides a sufficient condition for a differentiable quasi-convex 
function to be pseudo-convex. 

Definition 5.2. A point x* € R™ is a weak optimal Pareto point of F iff there is no x € M. n with F(x) -< 
F(x*). 

Proposition 5.2. Let H : 1" — > R m be a differentiable quasi-convex function. If each critical Pareto point 
of H is a weak Pareto optimal point, then H is a pseudo- convex function. 

Proof. Take y G R n . Since that, by hypothesis, each critical Pareto point is an optimal weak Pareto, if y 
is critical Pareto we have nothing to do. Let us suppose that y is not a critical Pareto point. Then, there 
exists v S 1" such that 

JH(y)v -< 0. (23) 

Let us assume, by contradiction, that H is not pseudo-convex. In this case, there exists x G R™ such that 
H(x) -< H(y), with 

JH(y)(x-y)^0. (24) 
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From and (|2"I|). it follows that 

JH(y)(x-y)-f3JH(y)v^0, (3 > 0. (25) 

Now, since -ff(:r) -< H(y), from the continuity of H there exists <5 > such that H(z) -< H{y) for all 
z G B(x,8) (ball with center in i and ray <5). In particular, H(x — (<5/2)(v/||u||)) -< H(y) and, because H is 
quasi-convex, we obtain 

JH(y) (x-(6/2)(v/\\v\\)-y)±Q. 
But this tell us that with (3 — S/(2\\v\\), we have 

JH(y)(x-y)-/3JH(y)v±0, 

which is a contradiction with (|25j) . and the resulted is proved. □ 

Remark 5.3. Consider the following vectorial function H : R — > Mr given by H(t) = (t, -i 3 /3). Note that 
H is not compenentwise pseudo-convex because /i2(i) := — 1 3 /3 is not pseudo-convex. However, since H is 
quasi-convex and each critical Pareto point of H is weak Pareto optimal point for H , from last proposition, 
it follows that H is pseudo-convex. 

We know that criticality is a necessary, but not sufficient, condition for optimality. In [5] the authors 
proved that, under convexity of the vectorial function F, criticality is equivalent to the weak optimality. 
Next we prove that the equivalence still happens if F is just pseudo-convex. 

Proposition 5.3. Let H : R" — > W" 1 be a pseudo-convex function. Then, x 6 R" is a critical Pareto point 
of H, i.e., 

Im(VH(x)) n (-R£ + ) = 0, 
iff x is a weak optimal Pareto point of H . 

Proof. Let us suppose that a; is a critical Pareto point of H. Assume by contradiction that x be not a weak 
optimal Pareto point of H, i.e., that there exists x G R" such that 

H(x) ■< H{x). (26) 

As H is pseudo-convex, then (|2"6"]l implies that 

JH(x)(x-x) -< 0. 
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But this contradicts the fact of x being a critical Pareto point of H, and the first part is concluded. The 
second part is a simple consequence of the fact that F is differentiable with the definitions of critical Pareto 
point and weak optimal Pareto point. For more details, see [6]. □ 

Definition 5.3. A sequence {z k } C M is quasi-Fejer convergent to a nonempty set U iff, for all z G U, 
there exists a sequence {ek} C M+ such that 
+00 

]Te fc <+oo, \\z k+1 ~z\\ 2 < \\z k ~ z\\ 2 + e k , k = 0,l,.... 

k=0 

In next lemma we recall the theorem known as quasi-Fejer convergence. 

Lemma 5.1. Let U C K" be a nonempty set and {z k } C M™ a quasi-Fejer convergent sequence. Then, 
{z k } is bounded. Moreover, if an accumulation point z of {z k } belongs to U, then the whole sequence {z k } 
converges to z as k goes to +00. 

Proof. See Burachik et al. [S]. □ 
Consider the following set 

U := {x e M" : F(x) * F(x k ), fc = 0,l,...}. (27) 

In general, the above set may be an empty set. To guarantee that U is nonempty, an additional assumption 
on the sequence {x k } is needed. In the next remark we give such a condition. 

Remark 5.4. // the sequence {x k } has an accumulation point, then U is nonempty. Indeed, let x be an 
accumulation point of the sequence {x k }. Then, there exists a subsequence {p ki } of {x k } which converges to 
x. Since F is continuous {F(x k )} has F(x) as an accumulation point. Hence, using {F(x k )} as a decreasing 
sequence (see item i of Theorem I5.1[) the usual arguments easily show that the whole sequence {F(x k )} 
converges to F(x) and the following relation holds 

F(x)^F(x k ), £ = 0,1,..., 

which implies that x e U , i.e., U 7^ 0. 

Assumption 1. Each v k of the sequence {v k } is a scalarization compatible, i.e., exists a sequence 
{w k } C conv5 such that 

v k = - JF(x k Yw k , fc = 0,l,.... 
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As was observed in Section [U this assumption holds if v k — v(x k ), i.e., if v k is the exact steepest descent 
direction at x k . We observe that the Assumption 1 also was used in [T3] for proving the full convergence of 
the sequence generated for the Method in the case that F is convex. From now on, we will assume that the 
Assumption 1 holds true. 

In next lemma we present the main result of this section. It is fundamental to the proof of the global 
convergence result of the sequence {x k }. 

Lemma 5.2. Suppose that F is quasi-convex and U, defined in (2T\ ), is nonempty. Then, for all x € U , the 
following inequality is true: 

\\x k ^-xf<\\x k -xf+tl\\v k f. 

Proof. Consider the hinge (^x k x, x k x k+1 , a^j , where x k x is the segment joining x k to x; x k x k+l is the segment 
joining x k to x k+1 and a — Z(x — x k ,v k ). By the law of cosines, we have 

||:r fc+1 - i|| 2 = \\x k ~x\\ 2 + 4\\v k \\ 2 - 2t k \\x k - i||||i; fe || cosa, k = 0, 1, . . . . 

Thus, taking into account that cos(7r — a) = — cosa and (—v k ,x — x k ) — \\v k \\\\x k — x\\ cos(7r — a), the above 
equality becomes 

||x fc+1 - 5|| 2 - \\x k - i|| 2 + t\ \\v k || 2 + 2t k {-v k lX -x k ), fe = Q, 1, 

On the other hand, from Assumption 1, there exists w k £ convS such that 

v k = -JF(x k ) t w k , fc = 0,l,.... 

Hence, the last equality yields 

\\x k+1 -x\\ = \\x k - x\\ 2 + t 2 k \\v k \\ 2 + 2t k (JF(x k ) t w k 1 x - x k ), fc = 0,l,..., 
from which, we obtain 

\\x k+1 - i|| 2 = ||x fe - i|| 2 + t 2 \\v k \\ 2 + 2t k (w k 1 JF{x k ){x - x k )), k = 0, 1, . . . . (28) 
Since F is quasi-convex and i G U, from Proposition 15. II with H = F, x = x k and y — i, we have 

JF(x k ){x-x k ) fc = 0,l,.... 

So, because w k 6 convS", we get 

(w k , JF{x k ){x-x k )) < 0, fc = 0,l,.... (29) 
Therefore, by combining ([28)1 with (|29)l . the lemma proceeds. □ 
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Proposition 5.4. If F is quasi-convex, 1" has non-negative curvature and U , defined in \27^ , is a nonempty 
set, then the sequence {x k } is quasi-Fejer convergent to U. 

Proof. The resulted follows from the item ii of Theorem l5.1l and Lemma [5.2l combined with Definition [573] □ 

Theorem 5.2. Suppose that F is quasi-convex, and U , as defined in p?7| ), is a nonempty set. Then, the 
sequence {x k } converges to a critical Pareto point of F. 

Proof. From Proposition 15.41 {x k } is Fejer convergent to U. Thus, Lemma 15.11 guarantees that {x k } is 
bounded and, hence, has an accumulation point x G R". Thus, from Remark 15.41 we conclude that x G U 
and, hence, that the whole sequence {x k } converges to x as k goes to +oo (see Lemma |5~T1) . The conclusion 
of the proof is a consequence of item in of Theorem 15.11 □ 

Corollary 5.1. // F is pseudo- convex, R n has non-negative curvature and U, as defined in is a 

nonempty set, then the sequence {x k } converges to a weak optimal Pareto point of F. 

Proof. Since F is pseudo-convex, and in particular quasi-convex, the corollary is a consequence of the previous 
theorem and Proposition [231 

6 Variational Rationality: Inexact Proximal Algorithms as Self 
Regulation Problems 

In this section, we consider an endless unsatisfied man, who, instead to renounce, aspires, and partially 
satisfice, using worthwhile changes. 

6.1 Variational rationality 

1) The course between unsatisfied needs, aspirations, and satisfaction levels 

Variational rationality (Soubeyran [26, 27, 28, [29]) is a purposive and dynamic approach of behaviors. It 
modelizes the course pursuit between desired ends and feasible means. It is a theory of the endless unsatisfied 
man, who, given a lot of unsatisfied needs, both renounces to satisfy some of them and aspires to satisfice 
some others. Let us summarize some of the main points of this conative approach of behaviors, based on 
cognition (knowledges), motivation (desires) and affect (feelings and emotions), 
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i) the agent, focusing his attention on the unsatisfied needs he has chosen to satisfice, considers desired 
ends. He forms aspirations (distal goals). Setting aspirations is a way to know what he really wants 
among all his wishes, without considering if they are realistic or not. 

ii) then, the agent starts to consider feasible means (defined as the means he must find, build, gather and 
learn how to use); 

iii) given the difficulty to gather such feasible means, the agent chooses to partially satisfice his aspirations; 
Then, the agent self regulates all his goal oriented activities: 

iv) goal setting, setting proximal goals is a way for him to divide the difficulty, to better know what he can 
really do. This allows him to balance between "desired enough" ends and "feasible enough" means; 

v) goal striving represents the path (way of doing, strategy) the agent chooses to follow and the obstacles 
he must overcome to attain his successive proximal goals and partially satisfice; 

vi) goal pursuit is the revision of his goals, using feedbacks coming from successes and failures. 

This variational approach is progressive (adaptive). The step by step joint formation of distal (global) 
and proximal (local) goals and related actions is a process including a lot of interactions, tatonnements, 
adjustments,. . . , driven by inexact perceptions, evaluations and judgments. 

Among several variational principles, three of them are worth mentioning in the present paper, 

- the "satisficing with not too much sacrificing" principe and the "worthwhile to change principle" 
(Soubeyran [H[57]) 

- the "tension reduction-tension production" principle (Soubeyran [28l [29] ') . 

2) Unsatisfied needs, aspirations, satisfaction and satisfying levels, and aspiration gaps 

In the specific case of this paper let us modelize the main motivational concepts of the variational approach 
of Soubeyran [551 [57] • They include, 

a) The map of unsatisfied needs (the needs system). An agent has two ways to perceive, judge 
and estimate a situation, either in term of unsatisfaction or in term of satisfaction. Usually an agent deals 
with a lot of unsatisfied needs which depend of his present situation x £ X . Let I = {1,2,..., m} be the list 
of different potential needs. The perceived unsatisfied needs functions are < n,(a;) < +00 be the strength 
of each perceived need i £ I for each situation x £ X. Let N : X —t R m , given by 

N(x) = (ni(x),n 2 (x), . . . ,rij(x), . . . ,n m (x)), 
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be the map of unsatisfied needs in this situation. These needs can be rather vague and abstract. Hull |17] 
and Murray [53] give an extensive list of different needs. 

b) The map of aspiration gaps. As soon as the agent chooses to do not renounce to satisfy, at least 
partially, all these unsatisfied needs, they become, in this present situation, aspiration gaps di(x),i £ /, 
although in general, the perceived aspiration gaps a,i{x) > 0,i £ I are lower than perceived unsatisfied 
needs: < a 2 ;(x) < rii(x), i £ I because agents usually aspire to fill no more than their unsatisfied needs. 

c) The map of aspiration levels (desirable ends, or the distal goal system). Let us denote 
by ~g~i{x) 1 i £ I, the aspiration levels for x £ X. They represent still vague, abstract, and non committed 
higher order goals (visions, ideals, aspirations, fantasies, dreams, wishes, hopes, wants, and desires). These 
aspirations levels represent desirable (but perhaps irrealistic) ends. Lewin [2 1] defines aspiration levels as 
desirable ends, some being irrealistic in a near future, and others not. 

d) The map of satisfaction levels (the experienced utility system). Most of the time unsatisfied 
needs are partially satisfied. Let G : X — > R m , G(x) = (gi(x), 52 (x), .., g%{x), .., g m (x)), be the map of present 
satisfaction levels (or outcomes) in the present situation x, where gi(x) < gj(x), i £ I, i.e, the levels at which 
all needs are partially satisfied. 

e) The map of discrepancies (the drive system). The differences between aspiration levels and 
satisfaction levels define more precisely aspiration gaps <Zj(x) = 5j(x) — g%(x) > 0,i £ I which are non 
negative. We will assume in this paper that aspirations gaps are equals to unsatisfied needs, i.e, they 
represent the discrepancies 

fi(x) = m(x) = a.i(x) = g^x) - gi(x) > 0,i £ I, x £ X, 

because, usually, agents aspire to satisfy their perceived unsatisfied needs, even if, in a second stage, they 
have not the intention to satisfy all of them. The crude perception of these gaps generates feelings and 
emotions, the so called drives, Hull [17] , 

We consider here satisfaction levels gi(x), i £ 7, instead of discrepancies. Moreover, for simplification, 
we consider all aspiration levels as constant, i.e, <?j(x) =~g~i< +00, i £ I, x £ X. Then, 

fi( x ) = 9i ~ 9i( x ) > 0; and 9x{v) = -f x (y), i£l, x £ X. 

The main problem is to know how the agent sets all these levels, step by step (progressively). 
3) Feasible means and the "goal system" 
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A "goal system" (Kruglanski et al.[5D]) comprises i) a cognitive network of mental representations of 
goals and means which are structurally interconnected by several more or less strong cognitive links, ii) in 
the short run a subset of limited available resources (physiological, material, mental, social means) because 
means are scarce and difficult to obtain, iii) an allocation process where goals compete for the use of these 
limited available resources, iv) a motivational process of goal setting, goal commitment, goal striving and 
goal pursuit (using affective feedback engendered in response to success and failure outcomes, goal revision 
including persistence of pursuit, means substitution and the management of goal-conflict). In our specific 
case the goal system is the satisfaction map 

G(x) G R m . 

Available means are identied to the situation x G X. These means can represent actions, resources and 
capabilities "to be able to do" them (see Soubeyran [26l [27] ) . The fact that outcomes compete for restricted 

means can be modclizcd as follows: we decompose the given x into the sum x = X\ + xi + ... + Xi + + Xk 

where Xi G X is the bundle of means allocated to goal i and gi = gi(xi) is the level of satisfaction of this 
objective. 

6.2 The proximal "satisficing-but not too much sacrificing" principle 

The local evaluation of marginal satisfaction levels of change. Starting from the situation x G X, 
let v G X be a direction of change, t > be the intensity of change, u = y — x = tv be the change and 
G x {v) = JG(x)v be the vector of marginal satisfaction levels of change. The related differents needs may 
have different degree of importance and urgency and, each step, the agent must weight each of them to define 
priorities. This task (solving trade off) is not easy and must be done progressively. Define g x : K™ — > K, 
given by 

g x {v) := mw.i e i(Vgi(x),v) = min ieI (JG(x')v)i,i el, i G I, 

the marginal satisfaction function. It represents the minimum of the different marginal satisfaction levels 
(JG(x)v)i,i G /. The consideration of this marginal satisfaction function avoids to choose weights for each 
marginal satisfaction level and to have to adapt them each step. 

Taking care of exploration costs. Let situations like x G X represents means which generate the 
vector of satisfaction levels G(x). The agent, in situation x G X, considers (explores) new situations 
y = x + tv, t > 0, v G X. This global exploration process is costly. Let us define the local consideration costs 



23 



(search costs, exploration costs) c x (v) = (1/2) ||w|| 2 > 0. The choice of a quadratic function modelizes the 
case where local exploration is not too coslty: consideration costs c x (v) > are large "in the large" and 
small "in the small". Notice that, while the agent considers feasible directions of change over the whole state 
space, he takes care of consideration costs (a local aspect). 

The local search of directions of aspirations. In general (Soubeyran &7\) the proximal payoff 
balances desired ends and feasible means. In this paper the proximal payoff l x (v) — g x (v) — c x (v) balances 
the marginal satisfaction levels and the costs to consider them (local exploration costs). Since 

l x (v) > g x (v) > c x (v), 

we will say that it is "worthwhile to explore in direction v, starting from x" , because the marginal satisfaction 
level g x {v) in this direction is higher than the costs c x (v) to be able to consider them. For each x G X 
consider the local search proximal problem: find a direction of change v(x) £ X such that l x {v(x)) = 
sup {l x {v),v G X} . Let l x = sup {^(v), v G X} be the optimal proximal payoff function at x and v(x) — 
&rgmax{l x (v), v G X} G X be the unique optimal direction of change, starting from x. Then, l x = l x (v(x)). 
From Lemma 14.11 and Lemma 14.21 it follows that 

1) If x G X is Pareto critical, then v(x) = 06l and l x = 0. 

2) If x G X is not Pareto critical, then l x > and g x (v) > g x (v(x)) > c x (v(x)),i G /. 

3) The mappings I3i4 v(x) G X and X 3 x H> l x G R are continuous. 

Starting from x and using variational rationality concepts, the optimal direction of change v(x) defines the 
unique direction of aspiration and the optimal proximal payoff l x = l x (v(x)) defines the proximal aspiration 
level (net of consideration costs). From Lemma |4~T1 and Lemma [OJ it follows that 

1) If x G X is Pareto critical, then the direction of aspiration and the proximal aspiration level are zero. 

2) If x G X is not Pareto critical, then, their is a strictly positive direction of aspiration and the proximal 
aspiration level is strictly positive. 

3) direction of aspirations and proximal aspiration levels are continuous 

The local determination of a local satisficing direction of change: If x G X is not Pareto critical, 

i) set a local (net) satisficing level of change l x = al Xl < a < 1 which is positive and strictly lower than 
the local aspiration level of change l x > 0. As a "variational rationality" concept (Soubeyran [26] [27]), this 
local satisficing level of change l x is situational dependent (it changes with x G X). Simon [25] . the father 
of the satisficing concept, defines an invariant satisficing level without any reference to an aspiration level of 
change J x . 
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ii) Then, the agent will try to find a direction u§ £ X such that l x (v§) > l x . This means that the satisficing 
direction of change v§ "improves enough" with respect to the aspiration direction of change v(x), including 
exploration costs. In variational term such a direction not only satisfices (Simon |25j ) but even more, it 
balances satisficing ("improving enough") marginal satisfactions to change n x (v) with some sacrifices to 
change c x (v), because the net satisfaction level l x (v§) is higher than the (net) satisficing level. This is a local 
version of the variational "sacrificing with not too much sacrificing" principle (Soubeyran |26i 127] ). This is 
equivalent to say that the satisficing direction of change v§ £ X is an inexact solution of the local search 
proximal problem. In this context proximal goals are local aspiration levels of change and satisficing levels 
of change. 

Remark 6.1. In term of variational rationality, the Fliege-Swaiter tlljj steepest descent method appears to 
be an "aspiration driven local search proximal algorithm" . In situation x £ X , the distal goal is the aspiration 
level of change l x and the proximal goal is the satisficing level l x . 

6.3 The proximal "worthwhile to change" principle and goal difficulty 

Inertia matters because to be able to change from some situation x £ X to a new improving situation y £ X 
is costly. As variational concepts, there are two kinds of "costs to be able to change" (Soubeyran [26l[27]): i) 
consideration costs (perception, exploration, search and evaluation costs . . . ), ii) capability costs to change 
C(x,y), i.e, the costs to be able to change (to be able to stop to use old means, to be able to use again 
old means, and to be able to imagine, find, build, gather, and learn how to use new means). Means can be 
capabilities ( competences, skills), ingredients and resources. In the present paper consideration costs are 
c x (v) = (1/2) ||i;|| 2 and capability costs to change are C(x,y) — K [tJG(x)(y — x)\ — tJG(x)(y — x), with 
t > 0. This formulation, specific to the present paper, means that costs to change from x to y increase with 
the difficulty to change, modelized here as the vector of gradients A(x,y) — tJG(x)(y — x), including the 
step length of change t > 0. Then, the second variational principle tells us that it is "worthwhile to change" 
from x to y if advantages to change A(x, y) — G(y) — G(x) are higher than some proportion, f3 > 0, of costs 
to change, i.e G(y) — G(x) > f3C(x, y) where j3 > is a rate of tolerance which calibrates how the change 
(transition) x r\ y is acceptable. More generally (Soubeyran [26l [27] ) it is "worthwhile to change" from x 
to y if motivations to change M(x,y) — U[A(x, y)\ are higher than some proportion, (3 > 0, of resistances 
to change R(x,y) = A[C(x,y)} where U(.) and A(.) are the experienced utility and desutility of advantages 
and costs to change. 
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The variationel concept of "worthwhile changes" is related to the famous Lindblom [35] "muddling 
through" economizing principle where agents make small steps (incremental changes, choosing the step 
size in our context) and successive limited comparisons (balancing pro and cons). 

Remark 6.2. Our paper considers quasi-convex (or quasi-concave) payoffs. In term of variational ratio- 
nality, this case is very interesting, because it allows large flat portions which can be very costly to explore 
(quadratic exploration costs are "large in the large"). Hence, in this case, convergence is a very nice result. 

6.4 Local exploration traps 

The goal of this paper has been to give conditions of convergence of a path of change towards a Pareto 
optimum in a multicriteria optimization setting. The variational concept of a behavioral trap (Soubeyran 
[26l [27] ) appears in this context at the local level of the consideration (say exploration) process. More 
precisely, we will say that x* £ X is a local exploration trap if 

l x » = l x * (v) = g x > (v) - c x * (v) < 0, v e X. 

This means that, locally, it is not worthwhile to explore, because, whatever the direction of change v G X, 
marginal advantages to change n x *(v) are lower than local exploration costs to change c x *(v). Lemma 14.11 
shows that if x* e X is Pareto critical, then x* S X is a local exploration trap. 

7 Final Remarks 

We proved full convergence of the sequence generated by this inexact method to a critical Pareto point 
associated to quasi-convex multicriteria optimization problems. We also show a striking result, i.e, the 
strong connexion of such an inexact proximal algorithm with the self regulation problem in Psychology. 
Further researches can be made in this direction. 
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